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ABSTRACT 


A genuine  small  sample  theory  for  post -stratification  is  developed  in 
this  paper.  This  Includes  the  definition  of  a ratio  estimator  of  the  popula- 
tion mean  Y,  the  derivation  of  Its  bias  and  its  exact  variance  and  a discussion 
of  variance  estimation.  The  estimator  has  both  a within  strata  component  of 
variance  which  is  comparable  with  that  obtained  in  proportional  allocation 
stratified  sampling  and  a between  strata  component  of  variance  which  will 
tend  to  zero  as  the  overall  sample  size  becomes  large.  Certain  optimality 
properties  of  the  estimator  are  obtained.  The  generalization  of  post-stratl- 
flcatlon  from  the  simple  random  sampling  to  post-stratification  used  in  conjunction 
with  stratification  and  multi-stage  designs  is  discussed. 
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"An  Exact  Small  Sample  Theory  for  Post-Stratification" 

by 

D.  C.  Doss*,  H.  0.  Hartley^  and  G.  R.  Somayajulu^ 

1.  Introduction 

As  is  well  known,  strata  are  defined  as  nonoverlapping  and  exhaustive 
subsets  of  the  units  of  a population  with  the  following  properties: 

(a)  The  total  number  of  units  in  stratum  h of  the  population 
is  known, 

(b)  It  is  possible  to  identify  in  advance  of  sampling  the  stratum 
h to  which  each  unit  belongs  and  prescribed  sample  sizes 

^ S’"®  drawn  from  stratum  h. 

"Post  strata"  differ  from  strata  in  the  sense  that  condition  (b)  is  no 
longer  satisfied.  However,  it  is  assumed  that  after  sampling  it  is 
possible  to  identify  for  each  elementary  unit  the  post-stratum,  h,  to 
which  it  belongs. 

The  literature  on  post-stratification  is  almost  exclusively  confined 
to  the  case  of  a simple  random  sa(nple  of  size  n drawn  from  the  population. 
If  we  define  by  n^^  the  number  of  units  which  "happen  to  fall"  into  a post- 
stratum h then  the  n^^  become  random  variables  following  a hypergeometric 
distribution.  It  is  well  known  that  the  literature  on  post-stratification 
is  essentially  confined  to  a situation  where  the  probability  that  = 0 
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can  be  assumed  to  be  negligible.  Accordingly,  the  estimator  of  the 
population  mean  Y considered  is  of  the  form 

L N.  _ 

Tl'h  (’•') 

where  the  N^/N  are  the  known  post-strata  proportions  in  the  population  and 
is  the  ordinary  sample  mean  of  the  units  falling  into  post-stratum  h 
provided  n^^  ^ 1 . The  definition  of  yjl^  for  the  case  = 0 varies.  As  is 
well  known,  if  the  probability  that  n^  = 0 is  negligibly  small  the  above 
estimator  is  approximately  unbiased  and  has  a variance  which  is  approxi- 
mately equal  to  that  of  a stratified  estimator  for  proportional  allocations. 

If  the  above  approximations  are  accepted  it  would  follow  that  all  the 
benefits  derivable  from  stratification  and  proportional  allocation  can  be 
attained  by  the  above  device  of  post-stratification.  Unfortunately, 
experiencp  with  pnst-stratifiration  when  the  sample  size  n is  comparatively 
small  and  the  number  of  strata  is  comparatively  large  is  distinctly 
disappointing. 

It  is  therefore  the  purpose  of  this  paper  to  develop  a genuine  small 
sample  theory  for  post-stratification.  This  will  include  the  precise 
definition  of  the  estimator  of  Y,  the  derivation  of  its  bias  and  its  exact 
variance  and  a discussion  of  variance  estimation.  It  is  not  surprising 
that  our  findings  will  show  that  our  post-stratified  estimator  will  have 
both  a within  strata  component  of  variance  which  is  comparable  with  that 
obtained  in  proportional  allocation  stratified  sampling  but  also  a between 
strata  component  of  variance  which  will  tend  to  0 as  the  overall  sample 
size  n becomes  large  to  an  order  which  is  0(n"*).  The  derivation  of  our 
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compact  and  exact  variance  formulas  for  both  components  of  variance 
enables  us  to  derive  certain  optimality  properties  of  our  estimator 
together  with  recommendations  for  sampling  strategies. 

In  the  last  section  we  also  discuss  the  generalization  of  post- 
stratification for  survey  designs  that  are  more  realistic  than  a simple 
random  sample.  These  include  post-stratification  used  in  conjunction 
with  stratification  and  multi-stage  designs.  However,  these  generalize 
tions  are  only  discussed  in  generality  and  not  spelled  out  in  detail. 
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2.  A Ratio  Estimator 

Throughout  this  paper,  we  consider  only  a simple  random  sample  of 
size  n from  a population  of  size  N with  L strata  (L  l 2).  However, 
generalizations  of  the  design  are  considered  in  section  6.  Defining  the 
"indicator  variables" 


1 if  at  least  one  unit  of  the  sample  of  size  n 
is  in  stratum  h 


(2.1) 


(0  otherwise, 

we  start  with  an  unbiased  estimator  of  Y of  the  form 


y ' “hVh^E(%)  (2-2) 

where  = N^/N,  E(a^)  = 1 " ( summation  extends  over 
all  strata.  When  a^  = 0,  can  be  defined  arbitrarily  as  a constant, 
say,  Y^,  the  population  mean  of  stratum  h,  since  the  corresponding  term 
in  (2.2)  is  zero.  The  unbiasedness  of  this  estimator  fcllcivs  frs.T. 

E(y)  = E(E(y))  = E(ia^P^7^/E(a^))=  eP^Y^  = Y (2.3) 

where  E is  the  conditional  expectation  given  (ni,..,,n, ) and  E is  the 
2 L 1 

expectation  over  (ni,...,n, ).  Similarly,  we  define  variances  V and  V and 

■-  1 2 

covariances  Cov  and  Cov.  Note  a similarity  of  our  estimator  to  the  well* 
1 2 

known  Horvitz -Thompson  estimator  of  Y in  a random  sample  with  unequaH 
probabilities  of  selection. 

A serious  drawback  of  y is  that  its  variance  depends  on  the  origin 
of  the  y values.  To  demonstrate  this  we  consider  a translation  of  each 
y to  y + c where  c is  an  arbitrary  constant  and  the  estimator  of  7 + c 
becomes 

(y^c)  = J:aj^P,^(y^  + c)/E(aj^)  = y + cx  (2.4) 


i ^ 
; ( 


I 

I 
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where 


(2-5) 


and  the  variable  = 1 for  all  population  units.  Incidentally,  x is 

an  unbiased  estimator  of  7 = 1.  Now  clearly  we  have  for  the  variance 

V(y  + c)  = V(y)  + c2v(x)  + 2cCov(y,x)  (2.6) 

and  it  is  obvious  that  V(y  + c)  can  be  made  arbitrarily  large  by  increasing 

c sufficiently.  This  is  due  to  the  fact  that  x is  not  a constant. 

In  order  to  eliminate  the  dependence  of  the  variance  on  the  origin 

of  the  y values  we  turn  our  attention  to  a ratio  estimator  of  Y which  is 

def i ned  by  _ 

Za.  P.y. /E(a.  ) 

R . - y/x  = (2-7> 

The  variance  of  R is  unaffected  by  translation  of  y values  since  from  (2.2) 
(y^c)/x  = (y/x)  + c.  (2.8) 


Now  the  ratio  estimator  (2.8)  will  in  general  be  slightly  biased. 
However  in  the  particular  case  where  all  strata  proportions  are  equal, 
our  ratio  estimator  is  shown  to  be  unbiased  in  Appendix  II.  In  other 
cases  the  bias  of  R as  an  estimator  of  Y is  of  the  order  of  magnitude 
O(P^qJJ)  or  0(Pj^qJJ^M  where  Qh  ~ ^ " ^h  Appendix  I).  Therefore,  even 
for  a moderate  sample  size  n the  bias  is  negligible  provided  the  Pj^  are 
greater  than  or  equal  to  cn"\ 

If  the  number  of  strata  is  large  and  all  P^  are  small,  while  n is 
moderate,  we  show  in  Appendix  I that  the  bias  is  of  order  0(n’L“^)  or 
0(n^L  ).  Once  again  the  bias  is  negligible.  The  bias  can  be  exactly 

evaluated  for  a small  number  of  strata  by  direct  computation. 


I 


3.  The  Exact  Variance  of  the  Ratio  Estimator 

There  are  two  components  of  variance  resulting  from  the  well-known 


relation 


V(R)  = V(E(R))  + E(V(R)) 
12  12 


(3.1) 


where  again  E and  V are  conditional  expectations  and  variances  given  a set 
2 2 

of  n.  and  E,  V are  expectations  and  variances  over  the  n.  . The  terms 
nil  n 

V(E(R))  and  E(V(R))  are  called  the  between  strata  component  and  the  within 
12  12 

strata  component  of  variance  of  R and  denoted  as  V(R)g  and  V(R)j^  respectively. 
First  we  derive  the  between  strata  component  in  a compact  form  which 


requires  recasting  R in  a simple  form  as 

where  •'  = 


k - 

■ L 


k-  I 


(3.2) 


(3.3) 


Since  ib^^  = 1 , we  obtain  for  any  fixed  h 

^ Cov(b.  ..b.)  = E(  Z b.  .b.)  - E(  l b.  .)E(bJ 
h7h  ^ ^ h7h  ” ^ h7h  ^ ^ 

= E{(1  - b^)bj,}  - E(1  - b^)E(bj,)  (3.4) 

= -v(b,). 


Si  nee 


E(R)  = tb,E(7,)  = . 


we  find  that,  by  virtue  of  (3.4) 


(3.5) 
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L _ L 

V(R)  = V(E(R))  = I V(b.)Yj  + E Cov(b.  ..bjY.  ,Y. 
« 12  h=li  ^ ^ h7h=l  1 h h h h 


L L _ L 

= l/2{  Z y(bj^)Yj  + zy(b  .)Y2  +2  z Cov(b.  ..bjY.  .7,} 
h=ii  ^ h=li  " " h7h=l  1 h h h h 


= 1/2  z Cov(b.  ,.bJ(Y2  + 72  - 27J.  .) 
h7h=1  1 h h h h h h 


L (E^,)E(b,)  -E(b,.b,)) 
h'|ih=1  ^ 


<%■  - ''h>" 


The  within  strata  component  of  variance  R is  given  by 


(3.6) 


V(R)n  = E(V(R))  = E(£bJV(y^))  = E(zbJ((l/n^)  - (l/(y)Sg) 


= EE'(bJ/nh)E(,h)S-;  - «(bJ)S^/R, 


(.1.7) 


where  E'  stands  for  the  conditional  expectation  given  n^^  >_  1 and  is 
the  population  mean  square  of  stratum  h,  i.e. 


N. 


(3.8) 


In  the  particular  case  where  all  are  equal,  the  components  of 
variance  of  R reduce  (see  Appendix  II)  to  the  very  simple  forms 

V(R)b  = {E(l/v)  - (1/L)}S§  (3.9) 

and  V(R)y  = {E'(l/v2nj^)E(aj,)  - E(1/v2)(L/N))eS^  (3.10) 


-8- 


where  v = the  number  of  strata  represented  in  the  sample, 
S|  = j:(yJ  - LY2)/(L  - 1). 


and 


E(a,)  = 1 


/N((L  - I 
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4.  The  Efficiency  of  R 

In  order  to  reduce  the  variance  of  the  estimator  of  Y stratified 
sampling  is  employed  in  practice  with  different  allocation  schemes 
of  the  sample.  In  particular  wh^t!  the  population  strata  means  differ 
considerably  from  each  other  and  the  patterns  of  strata  variances 

h 

differ  for  different  content  items,  the  scheme  of  proportional 
allocation  is  used  to  eliminate  this  variability.  But,  if  stratified 
sampling  is  not  possible  because  of  (b),  then  it  is  of  interest  to 
find  an  estimator  of  T based  on  post-stratification  which  would 
minimize  the  between  strata  variation  and  at  the  same  time  would  not 
increase  the  within  strata  variation  "unduly". 

A reasonable  class  of  unbiased  estimators  of  Y based  on  post- 
stratification that  can  be  computed  from  the  sample  is  given  by 

ro(nj^)Pj^y,^,/E(n(n^)  (4,1) 

where  is  any  mathematical  function  defined  for  all  values  of  n^ 

and  is  such  that  g(0)  = 0 and  E(g(n^))  0.  This  class  clearly  includes 

the  sample  mean  y by  letting  g(n^)  = n^.  All  estimators  of  (4.1)  with 
the  exception  of  y suffer  from  the  same  drawback  as  y defined  by  (2.1), 
that  is,  the  increase  in  variance  through  translation.  Hence  the  logical 
step  to  eliminate  this  effect  is  to  consider  ratio  estimators  analogous 
to  (2.7). 

It  is  shown  in  Appendix  II  that  the  ratio  estimators  obtained  from 
(4.1)  are  unbiased  and  our  estimator  R given  by  (2.6)  minimizes  the 
between  strata  component  of  variance  in  this  generalized  class  of 
estimators  (4.1)  when  all  strata  sizes  are  equal.  In  the  case  where  all 
strata  sizes  are  not  equal,  there  exists  no  ratio  estimator  that  minimizes 
the  between  strata  component  of  variance  if  9(nj^)  is  required  not  to 
depend  on  the  population  strata  means  or  strata  variances. 
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The  between  strata  component  of  variance  of  R,  V(R)g,  is  always 
smaller  than  that  of  y and  (as  is  seen  in  Appendix  I)  V(R)g  is  of  an 
exponential  order  of  magnitude  o'"  approaches  zero 

much  more  rapidly  than  V(y)g  which  is  of  the  order  0(n"*). 

When  the  number  of  strata  L is  large,  all  are  small,  and  n is 
moderately  large,  it  is  seen  in  Appendix  I that  an  approximate  V(R)n  is 

D 

“2  2 “3 

of  order  0(nL  ) or  0(n  L ).  This  implies  that  in  a situation  where 
the  usual  estimator  (1.1)  is  at  its  worst,  R has  a negligible  between 
strata  component  of  variance. 

Turning  now  our  attention  to  the  within  strata  component  of  variance 
we  consider  an  approximation  to  V(R)yj  since  the  exact  variance  is 
analytically  intractable.  In  Appendix  I we  show  that  to  terms  of  order 
0(n”^)  we  have  that 

V(R)j^  = ^ ■ 'h' 

which  clearly  approaches  the  variance  of  the  estimator  used  in  strati- 
fication with  proportional  allocation  for  large  n,  i.e. 

V(R)y  = J:(P^o2/n).  (4.3) 

The  asymptotic  result  (4.3)  is  also  correct  for  large  L if  all  P^  are 
small  and  n is  moderately  large. 

The  relative  efficiency  of  R as  compared  with  the  estimator  of  Y 


employed  in  stratified  sampling  with  proportional  allocation  approaches 
asymptotically  1 if  either  n is  large  or  L is  large  (so  that  all  P^^  are 
small)  while  n is  moderately  large. 


5.  The  Estimation  of  the  Variance  of  R 


An  unbiased  estimator  of  V(R)  is  given  by 


V(R)  = E a.a.  . 
h7h=l  " " 


{E(b^.)E(b^)  - E(b^.b^)) 


2ETa;;vr 


‘•hh' 


(5.1) 


h=i  h h h h=1  ^ ^ 


where  t^^^,  is  an  unbiased  estimator  of  given  ^ 0 


and 


n^,  ^ 0.  We  may  use  the  estimator 


'hh'  ' Vh'<<\  - J'h'*'  - <'h'"h>  ■ <'^^"h'» 


when  we  define  s^  = z (y^.  - y^)/(n^  - 1) 


(5.2) 

(5.3) 


and  $2,  by  replacing  h by  h ' in  (5.3).  Finally  in  (5.1)  we  define  d.^  by 


^h  = 


0 if  n.  = 0 
h 

1/nj^  ^ 1 . 


(5.4) 


The  computation  of  {(y^^  - y^i)^  - (^h^'^h^  ~ (5.2)  is  only 

required  if  both  n^  ^ 1 and  n^^,  >_  1 since  otherwise  = 0*  However, 

the  definitions  of  and  s^,  in  (5.1)  and  (5.3)  require  that  both  ^ 2 
and  n^,  ^ 2.  In  case  n^  = 1 and/or  n^^,  = 1 methods  of  estimating 
variances  from  single  units  per  stratum  have  to  be  employed  (see  e.g. 
Hartley,  Rao,  and  Kiefer  (19  )). 

If  the  number  of  strata  is  not  small,  then  it  becomes  very  tedious 
to  compute  E(bj^)E(b^,)  - E(bj^,bj^)  in  which  case  an  approximation  is 
provided  in  Appendix  I equations  (A-10)  to  (A-14). 
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6.  Post-stratification  for  More  General  Survey  Designs 

We  confine  ourselves  here  to  a brief  outline  of  the  main  general 
theory  of  post-stratification  for  stratified  multi-stage  designs.  We 
shall  utilize  the  theory  of  "Domain  Estimation"  (see  e.g.  Hartley  (1959)) 
by  identifying  post  strata  with  "domains  of  study". 

Denote  by  y.  the  characteristic  attached  to  the  i^^  last  stage 

A 

unit  and  by  Y(y^)  the  standard  unbiased  estimator  of  the  population 
total  of  the  y^.  The  estimator  Y(y^)  is  a well  defined  linear  function 
of  the  y^  in  the  sample.  Define  now  the  domain  variables 

fy^  if  unit  i is  in  domain  h 


h^i 


if  unit  i is  not  in  domain  h 


(6.1) 


and 


h^i  * 


if  unit  i is  in  domain  h 
if  unit  1 is  not  in  domain  h 


(6.2) 


and  consider  the  subset  of  samples  for  which  at  least  one  last  stage  unit 
falls  into  domain  h.  Denote  by 

= Pr{at  least  one  last  stage  unit  in  domain  h).  (6.3) 


For  this  subset  of  samples  the  estimate  Y(j^x^)  of  the  number  of  units  in 


domain  h will  be  greater  than  zero  since  Y(j^x^)  is  a linear  function  of 


the  j^x^  with  positive  coefficients.  Accordingly,  we  can  for  this  subset 
of  samples  compute  the  ratio  estimate  of  the  population  domain  mean  in 
the  form 

- Y(jv.) 

(6.4) 


h^  " 


Y(hXi) 


which  will  have  a "technical  bias"  given  by 
Biasj^  - 


) ! 


(6.5) 


-13- 


wherr^Cov  is  a conditional  covariance  applicable  to  the  above  subset 

of  samples  and  is  the  total  number  of  last  stage  units  in  domain  h. 

It  is  reasonable  to  assume  that  Cov(|^y,Y(^x. ))  will  be  zero  or  small 

since  the  estimate  of  the  mean  value  of  the  y characteristic  f.  vl  is 

h 

unlikely  to  be  correlated  with  the  estimate  of  the  number  of  units 
Y(hXi)  falling  into  domain  h. 

We  finally  turn  to  the  post-stratified  estimates  of  the  population 
mean  and  define  in  analogy  to  (2.2)  the  post-stratified  estimates 

^ ‘ “h'’h<V’h>-  * ' («•« 

fl  if  at  least  one  last  stage  unit 
is  in  domain  h 


where 


0 if  there  is  not  at  least  one  last 
stage  unit  in  domain  h. 


(6.7) 


Finally  we  define  the  double  ratio  estimator 

R = Wx  (6.8) 

which  isour  post-stratified  estimator  of  the  population  mean. 

The  main  difficulty  about  using  (6.6)  and  (6.8)  is  the  computation 

of  the  II defined  by  (6.3)  which  would  require  the  knowledge  of  the 

domain  sizes  in  each  last  but  one  stage  unit.  However  for  many  survey 

designs  it  is  possible  to  compute  approximations  to  the  w.  as  we  shall 

h 

illustrate  below: 

Assume  that  the  last  stage  units  are  sampled  with  equal  probability 
and  without  replacement  and  use  the  index  j to  denote  the  last  but  one 
stage  units.  Denote  by  p(s)  the  probability  that  a sample  s of  last  but 
one  stage  units  has  been  drawn  by  the  specified  survey  design.  Denote 
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by  iHj  the  specified  number  of  last  stage  units  to  be  drawn  from  the 
last  but  one  stage  unit  if  in  s.  Denote  by  the  proportion  of  last 
stage  units  in  the  last  but  one  stage  unit  which  are  in  domain  h 
and  by  1 fpc's)  the  probability  iS' given  by 

(6.9) 


ifu  = 1 - J:p(s)  n = 1 - Q" 
^ s j in  s ^ 


where  is  an  average  value  of  the  and  m is  an  average  value  of 
the  total  overall  last  stage  sample  size. 

Improvements  in  the  computation  of  the  and  the  spelling  out 
of  the  bias  and  variance  of  R will  be  left  to  subsequent  communications. 
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APPENDIX  I 


1.  The  order  of  magnitude  of  the  bias  and  V(R)  as  n 

O 

When  the  numerator  of  defined  by  (3.3)  Is  written  as 


Vh/«“h>  - '■h'l  S) 


where 


'h  - W - 

we  Immediately  observe,  for  any  positive  Integer  m,  that 


(A.l) 


(A.  2) 


(-1)  (1  - aj^)  + (_)  aj^ 


1-Qh 


(A.  3) 


where  ■ 1 - and  E(aj^)  * 1 - q||  under  the  assumption  that  the  strata 
sizes  are  sufficiently  large  for  approximating  the  hypergeometrlc  distribution 
of  by  a binomial.  Similarly  the  denominator  of  bj^  can  be  written  as 
L L 

^ ^ P.  (1  + e,  ) - 1 + e 

k-1  k-1 

where  e “ Z It  Is  not  difficult  to  see  that 

E(ej^)  - 0,  E(e”)  - (-1)“qJ  + qJ”  /(I  - . O(qJJ), 


(A.  A) 


E(e)  - 0,  E(e“) 


^ /k. 


rj^+. . .+rj^“m 


Supposing  that  |e|  < 1,  we  are  able  to  expand  b,  in  the  form 

h 

“ ^h  (^h  • ^^h^^  - e^)  + ...},  (A. 6) 

from  which  we  obtain 

E(bh  - P^)  - Pjj{  E(e^  - c^z)  + E(ej^e^  - e^)  + ...}. 


(A.  7) 
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Since  It  can  be  shown  that  E(e“  - is  of  order  of  magnitude  0(p”qJJ) 

0(Pj^  f the  bias  in  R is  given  by 

E(r  bj^7^)  - Y -E{E(b^)  - P^}  Y, 


(A.  8) 


which  is  of  order  0(P^q")  or  0(P,q“'*’^). 

h n h h 


Using  these  results  in  (3.6)  for  computing  the  between  strata  component 


of  variance  we  obtain 

E(b,)E(b^,)  - E(b^b^,)  . OCpJq”)  or 

We  therefore  conclude  that  V(R)_  ~ 0(P^q“)  or  0(P..q”^^). 

B h h h^h 


(A.9) 


We  thereby  obtain  a first  approximation  to  V(R)_  to  order  0(P^q")  or  0(P 

B h h h h 

by  omitting  the  terms  in  e,  e^,  with  degree  higher  than  2 in  the  expansion 
of 

E(bh)E(bh,)  - E(bj^b^,)  - EKbj^  - E(bj^)}{bj^,  - E(bj^,)}] 

■ ^h^h’ 

" ■ Cov(ej^,,e)} 

(A.  10) 


where 


V(e)  - E p2  + r - E(a^^)E(a^^,) 

k-1  ^ k’/k-l  E(ak)E(aj^,)  k k' ’ 


CovCCh,  Cj^,)  - 


E(ah>®h’^  - E(aj^)E(aj^;) 
E(a^)E(aj^,) 


1-E(a.  ) E<»h®V>  - E(a.  )E(a.  ) 

“ E(a,)  ^ ^ 4 “ ^(VECa^) 


E(V 


(«>) 


(A. 11) 


(A. 12) 


(A. 13) 


(A. 14) 
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E(Wi)  - E(aj^)E(aj^,) 


n 


(N-\) 

n ' n 


(A. 15) 


2.  The  order  of  magnitude  of  the  bias,  V(R)_  and  V(R)„  when  L -v  » p ■+  q 

B w ’ h 

and  n Is  moderately  large. 

Since  it  is  not  difficult  to  prove  that  the  bias  and  V(R)_  are  of  the  same 

B 

order  as  before,  we  shall  concentrate  on  V(R)g  and  V(R)^.  Without  going  into 
detail  we  can  obtain  a first  approximation  as  follows : 

V(R)  - V(y/x)  - V(y  - Yx)  (A. 16) 


L V(a.) 


+ 


+ 


L Cov(fL  ,a  ')  _ _ _ _ 

L tt'd/y  E(.^)  - E(.^/l.„)) 
h-l 


(A.  17) 


Assuming  as  before  that  the  strata  sizes  are  sufficiently  large  for  approximation 
of  the  hypergeometrlc  distribution  by  a multinomial,  we  can  write 

"V  ■ oj  - ’’S’-  <A-w 

■ <1  - ‘’h  - ’’h-’"  - 

and 

E(aj^)  - (1  - qJJ).  (A.  20) 


We  now  consider  cases  where  L is  large,  all  P.  are  small  while  n is 

n 

moderate.  Omitting  all  the  terms  in  with  higher  degree  than  2 in  the 
expansion  of  V(aj^)  and  Cov(aj^,aj^)  and  E*  (l/n^^)  we  obtain 
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and 


V(aj^)  » n(Pj^  - - I - 1)  Pj. 

Cov(ah.a^.)  * -n 


(A.  21) 
(A. 22) 

(A. 23) 


Which  on  substituting  In  V(R)  and  omitting  the  finite  population  correction, 

reduce  after  much  simplification  to  the  form 

L PJ.  L P.Y.  , 

V(R)  - n Z P ^ K ---  -] 

h-1  ^ 1 - q"  h»l  ^ (1-Qh) 


3n(n-l)  , ‘■J  ’'h  , t "'■hC  Wh  * ¥ fh*  2 

' ^ h h'l 


(A. 24) 


We  now  infer  that 


and 


V(R)g  ~ 0(n  l'^)  or  0(n^  l“^  , 


L P.  a . 

V(R)„  i Z + 0(  l"-^) 


^ h-1  " 

which  follows  from 

qJJ"^  (1  - Ph)"'^  . l-(n-l)Pj^  + p2 


(1-qJJ)^  {l-(l-P^)"}^ 


*^h  2 h^ 


2 2 
" '’h 


(A.  25) 


(A. 26) 


(A. 27) 


3.  The  order  of  magnitude  of  V(R)^  as  n “ 


For  large  n it  has  been  shown  by  Stephan  (1945)  that  to  terms  of  order 


-2 


h nV 


V(R)„  - 2 


. 1_  _ ^h  °h 
2 ^ , n 


n(l-Q“)  n‘  1-q: 


(A. 28) 


Then 


(A. 29) 
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whlch,  after  some  simplification. 


(A. 37) 

where  „ 

EY  2 - L 

urzT) — 

(A. 38) 

Since 

V(Cj^)  - E(c^)  - {E(c^)}^  - E(c^)  - 1/L 

(A. 39) 

■we  concentrate  on  minimizing 


E(c2) 


1 

r 


Z E(c‘)  - E 


(- 


-)-r^ 


E{g(n^^)}^ 


) P(n 


1* 


* “l  > 


(A.  40) 


where  the  first  summation  extends  over  all  possible  values  of  n^  and 

P(nj^ ) is  the  probability  of  getting  (n^^,  . . . , n^^  ) In  a sample  of 

size  n.  For  any  particular  value  (n^,  n'^^  ) with  V positive  n^  values,  we 

can  see,  by  Cauchy-Schwartz  Inequality, 


I{g(nj;) 

{Eg(n^)}^ 


(A. 41) 


and  equality  is  attained  when  g(nj^)  “ arbitrary  nonzero  constant.  Without  loss 
of  generality  we  assume  g(njj)  - 1 for  all  nj^jtO  which  minimize  E(c^)  , i.e.  R 
minimizes  the  between  stratum  component  of  variance.  In  fact 
V(i),  . (E(i)  - i )s| 


(A. 42) 
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